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CBX5, CBX1, and CBX3 (HPla, (3, and 7, respectively) play an evolutionarily conserved role in the formation and 
maintenance of heterochromatin. In addition, CBX5, CBX1, and CBX3 may also participate in transcriptional regulation of 
genes. Recently, CBX3 binding to the bodies of a subset of genes has been observed in human and murine cells. However, 
the generality of this phenomenon and the role CBX3 may play in this context are unknown. Genome-wide localization 
analysis reveals CBX3 binding at genie regions, which strongly correlates with gene activity across multiple cell types. 
Depletion of CBX3 resulted in down-regulation of a subset of target genes. Loss of CBX3 binding leads to a more dramatic 
accumulation of unspliced nascent transcripts. In addition, we observed defective recruitment of splicing factors, including 
SNRNP70, to CBX3 target genes. Collectively, our data suggest a role for CBX3 in aiding in efficient cotranscriptional 
RNA processing. 



[Supplemental material is available for this article.] 

Once thought of as static components of heterochromatin, het- 
erochromatin protein 1 family members (HP Is — CBX5, CBX1, 
and CBX3) are now known to function in a myriad of cellular 
processes, including DNA repair, gene regulation, and telomere 
function (Maison and Almouzni 2004; Hediger and Gasser 2006; 
Lomberk et al. 2006; Kwon and Workman 2008). SU(VAR)205 
(HPla), HP lb, and HPlc were first characterized as suppressors of 
position effect variegation (PEV) in Drosophila and, along with 
H3K9 methylation, were regarded as a hallmark of heterochro- 
matin or repressed/silenced regions of the genome (Dillon 2004; 
Huisinga et al. 2006; Lomberk et al. 2006). More recently, CBX5, -1, 
and -3 have been assigned roles in repression of transcription at 
euchromatic genes (Hiragami and Festenstein 2005; Hediger and 
Gasser 2006; Kwon and Workman 2008). CBX proteins interact 
with and are recruited to specific loci by repressor proteins such as 
Retinoblastoma (Rbl) (Nielsen et al. 2001), IKZF1 (Ikaros) (Brown 
et al. 1997), and TRIM28 (KAP1/TIF0) (Ryan et al. 1999; Schultz 
et al. 2002). CBX1 has been implicated in silencing of POU5F1 
during early myogenesis (Feldman et al. 2006). CBX3 has been 
linked to silencing at E2F- and MYC-responsive genes in G 0 cells 
(Ogawa et al. 2002) and the repression of the progestin responsive 
MMTV promoter in breast cancer cells (Vicent et al. 2006). In con- 
junction with SUV39H1, CBX3 is also responsible for chromatin- 
mediated repression of the HIV-1 gene (du Chene et al. 2007). Most 
recently, CBX3 recruitment, along with DNA and histone meth- 
ylation, has been implicated in PIAS1 -mediated FOXP3 repression 
in T-cell differentiation (Liu et al. 2010). Biochemical studies have 
shed light on potential mechanisms of action for CBX5, CBX1, and 
CBX3 in this context (Smallwood et al. 2007, 2008). 

In recent years, evidence has accumulated to implicate HP1 
proteins in transcriptional activation. In Drosophila, cytological 
and genome-wide location studies demonstrated that SU(VAR)205 
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binds to large domains in pericentric regions, repeat dense regions, 
transposable elements, and the entire X chromosome in male flies 
(Fanti et al. 2003; Greil et al. 2003; de Wit et al. 2005, 2007). In 
addition, SU(VAR)205 associated with exon-dense active genes on 
chromosome arms (Greil et al. 2003; de Wit et al. 2005, 2007). 
SU(VAR)205 was shown to be necessary for transcription of het- 
erochromatin associated genes (de Wit et al. 2007). SU(VAR)205 
was also found to positively regulate transcript levels in euchro- 
matin, most notably at the Hsp70 heat shock puff loci (Piacentini 
et al. 2003; Cryderman et al. 2005). Most recently, SU(VAR)205 has 
been found associated with the promoters of genes with stalled RNA 
polymerase II (Yin et al. 2011). In contrast, a separate study dem- 
onstrated that active genes bound by SU(VAR)205 have a lower 
expression level than unbound genes, suggesting a negative role in 
transcription for SU(VAR)205 (de Wit et al. 2005). HPlc was also 
mapped to ~ 150 targets in active euchromatin (Greil et al. 2003). It 
has also been suggested that HPlc can colocalize with poised Ser-5 
phosphorylated RNA polymerase II (RNAP II) (Font-Burgada et al. 
2008). More recently, it was demonstrated that HPlc plays a role in 
recruitment of the FACT histone chaperone complex to active re- 
gions of the Drosophila genome functioning to link RNAP II and 
FACT (Kwon et al. 2010). 

In murine cells, CBX3 was found localized to the coding re- 
gions of several genes (Vakoc et al. 2005). In addition, a switch has 
been observed from CBX3 binding at active genes to CBX5 and/or 
CBX1 binding inactive genes in human cells (Smallwood et al. 
2007; Mateescu et al. 2008). This suggests distinct and comple- 
mentary roles for different members of this family of proteins. 
However, low-resolution studies in the mouse and humans sug- 
gests that CBX5, -1, and -3 may have similar overlapping binding 
profiles (Vogel et al. 2006). It was previously shown that concur- 
rent overexpression of CBX5, -1, and -3 in ECR-293 cells resulted in 
up-regulation of four genes, although it was not determined if 
these were direct effects (Hwang and Worman 2002). In addition, 
CBX3 has been implicated in alternative splicing for a small subset 
of genes (Saint-Andre et al. 2011). To date, the exact nature and 
mechanism by which heterochromatin protein 1 family members 
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influence gene regulation remains un- 
clear. Genome-wide knowledge of spe- 
cific gene targets would be an invaluable 
tool for understanding the mechanism of 
CBX3 gene regulation. 

In the current study, we compre- 
hensively determined CBX3 binding sites 
in the human genome, with the use of 
genome-wide location analysis, to gain 
a clearer picture of CBX3 action in gene 
regulation. We demonstrated that CBX3 
localizes to the gene body of active genes 
in multiple cell lines and that its binding 
positively correlates with gene activity. 
Loss of CBX3 results in a decrease in 
transcript level of a specific subset of tar- 
get genes. Surprisingly, CBX3 is enriched 
at exonic sequences, and loss of CBX3 
binding led to an increase in unspliced 
transcripts of its target genes. Loss of 
CBX3 also diminished the recruitment of 
splicing factors to gene bodies, consistent 
with the observed splicing defects. Thus, 
CBX3 appears to promote optimal cotran- 
scriptional RNA processing through re- 
cruitment of the splicing machinery. Taken 
together, this study brings to light a novel 
role for the heterochromatin protein CBX3 
in gene regulation, providing a functional 
link between transcription, chromatin, 
and RNA processing. 
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Results 

Genome-wide localization of CBX3 
to the gene body of active genes 

CBX3 was previously found in the gene 
body of a few active mammalian genes 
(Vakoc et al. 2005; Smallwood et al. 2007). 
To get a clearer picture of CBX3 localiza- 
tion on a genome-wide basis, we per- 
formed ChlP-chip and ChlP-seq analysis 
in the human colorectal carcinoma cell 
line HCT116 Qohnson et al. 2007). Two 
biological replicates of CBX3 immuno- 
precipitations were analyzed by ChlP- 
seq, and the results correlated well. In 
addition, two independent biological rep- 
licates were used for ChlP-chip analysis 
with the NimbleGen and ENCODE plat- 
forms; all analyses yielded the same re- 
sults. We found CBX3 binding was com- 
monly confined within the transcribed 
gene body but was also found spanning 

larger chromosomal regions encompassing one or more genes (Sup- 
plemental Fig. SI A). In fact, analysis revealed that genome-wide 89% 
of CBX3 occupied sequences are intragenic (Fig. 1A). To determine if 
CBX3 binding correlates with gene activity, we grouped HCT116 
genes into three tiers by expression. Enrichment of CBX3 binding 
was then calculated for a region encompassing 10 kb upstream of the 
transcription start site (TSS) and 10 kb downstream from the 3' end. 
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Figure 1 . CBX3 localization to gene bodies correlates with gene expression in multiple cell types. (A) 
Distribution of ChlP-seq CBX3 targets between genie and nongenic regions in HCT116 cells. (B) En- 
richment of CBX3 to top, middle, and bottom third expressed genes from 10 kb upstream to 10 kb 
downstream from gene body. Average enrichment in 500-bp bins every 1 kb. Random region included 
as a control. (C) Venn diagram showing the overlap of ChlP-chip (NimbleGen arrays) CBX3 targets 
between K562, MDA-MB-231 , and HCT1 1 6 cells. CBX3 target genes are called for each cell type based 
on whether they have CBX3 peaks within their gene bodies. (D) Log transformed expression levels, from 
Affymetrix arrays, for K562 and MDA-MB-231 genes clustered from highest to lowest expression (Expr.). 
Heatmap of CBX3 enrichment, by ChlP-chip, for clustered genes, from 10 kb upstream of the tran- 
scription start site (TSS) to 1 0 kb downstream from the transcription end site (TES). For each gene we 
divide the gene body into 50 equal divisions and use the log 2 ratio of the nearest probe on the 
microarray to estimate the binding strength for every boundary position between these divisions. 
(£) ChlP-seq data visualized in UCSC browser snapshots showing CBX3 localization to the IL8 and IRF1 
genes, in HCT1 1 6 cells treated ± TNF for 30 min. 



The top third expressed genes were enriched for CBX3 binding, 
whereas the bottom third unexpressed genes were not bound, with 
enrichment peaking at the 3' end of the gene body and then tapering 
off (Fig. IB). We also performed ChlP-chip analysis with two other 
cell types, K562 (erythroid) and MDA-MB-231 (breast), using Nim- 
bleGen HD2 genome-wide tiling arrays. We found 7996 and 6241 
CBX3-bound targets in K562 and MDA-MB-231 cells, respectively, 
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compared with 8336 targets in HCT116 cells (Fig. 1C). CBX3 
binding was likewise correlated with gene expression levels in 
K562 and MDA-MB231, demonstrating this is general phenome- 
non (Fig. ID). For example, TIMP3, which has been implicated in 
breast cancer progression, was only expressed in MDA-MB-231 
cells, where it is bound by CBX3 (Supplemental Fig. SIB). In fact, 
over 90% of CBX3 targets for all cell types tested are expressed in 
the corresponding cells. Our data expand upon an initial obser- 
vation, demonstrating that CBX3 localization to the body of active 
genes is a genome-wide phenomenon. We therefore hypothesized 
that CBX3 is playing a functional role in gene regulation. 

Dynamic recruitment of CBX3 

The experiments described above demonstrate that CBX3 is pres- 
ent in the gene body of active genes in unstimulated cells. How- 
ever, it is not clear whether CBX3 is recruited upon initiation of 
transcription or is present at genes prior to activation. To address 
this question, we performed ChlP-chip analysis of CBX3 localiza- 
tion before or after stimulation of K562, HCT116, or HeLa cells 
with TNF or IFNG. Using a custom ENCODE tiling microarray 
covering —1% of the human genome (Kim et al. 2005a), we mapped 
the binding sites for CBX3 in HCT116 and HeLa cells both before 
and after IFNG and TNF treatment. Additionally, we determined 
gene expression profiles for each cell type under each condition 
by microarray experiments. We found that induction of transcrip- 
tion was correlated with a significant increase in CBX3 binding to 
the ENCODE arrays (Supplemental Fig. SIC; data not shown). Next, 
we performed ChlP-seq analysis on TNF-treated HCT116 cells and 
found that CBX3 was specifically and significantly recruited to 
many TNF-responsive genes (increased CBX3 binding was seen at 
336 of 523 up-regulated transcripts). Representative browser shots 
showing CBX3 occupancy at the IL8 and IRF1 loci, before or after 
TNF treatment, are shown (Fig. IE). 

CBX3 interacts with elongating forms of RNA polymerase II 

It was previously reported that RNA polymerase II could be coim- 
munoprecipitated from nuclear extract (NE) with CBX3 (Vakoc 
et al. 2005). Indeed, CBX3 enrichment to gene bodies has been 
shown to be abrogated by treatment with the P-TEFb inhibitor 
DRB, which prevents RNA polymerase II elongation on active 
genes (Vakoc et al. 2005; J. Espinosa, unpubl.). We performed 
coimmunoprecipitation assays using HeLa extracts and antibodies 
against CBX3. The results demonstrate that CBX3 interacts with 
the phosphorylated forms of RNA polymerase II (Supplemental 
Fig. SID). This result suggests that CBX3 localization to active 
genes could, at least in part, be mediated by its association with the 
elongating form of RNA polymerase II. However, we also note that 
CBX3 is not present at all active genes, suggesting that additional 
factors may affect CBX3 recruitment. Since CBX3 is known to bind 
to H3K9me2/3 via its chromodomain, we first asked whether this 
could also be the mechanism for its localization to gene bodies. 
However, we were not able to detect significant H3K9me2/3 at 
CBX3 target genes (data not shown). This is in agreement with HP1 
and SU(VAR)3-9 localizing together at pericentric regions but in- 
dependently at many other regions in Drosophila (Greil et al. 2003). 
Next, we tested the possibility that CBX3 could be binding to or 
stabilized by the H3K36me3 modification, since CBX3 binding 
was strongly correlated with histone H3K36me3 enrichment in 
multiple cell types (data not shown). Biotinylated histone H3 tail 
peptides with K9me3, K36me3, or no modification were incubated 
with purified CBX3 (Supplemental Fig. S1E). As expected, CBX3 



binds with high affinity to H3K9me3 peptides. However, no 
binding to the H3K36me3 modified peptide was detected. 

CBX3 positively regulates transcript levels of its target genes 

The data presented thus far suggest that CBX3 is recruited to genes 
upon activation and could therefore be playing a role in gene 
regulation. However, it is unclear if CBX3 is acting to increase or 
decrease transcriptional output at its target loci. To address this 
issue, we utilized an HCT116 cell line stably expressing a CBX3- 
specific shRNA construct (KD HCT116). CBX3 knockdown was 
validated by RT-qPCR and Western blot analyses (Supplemental 
Fig. S2A). In all subsequent experiments, the data presented as KD 
HCT116 cells is from analysis of clone 10 (clone 8 was also ana- 
lyzed with similar results to clone 10; data not shown). Gene 
expression levels in wild- type (WT) and KD HCT116 cells were 
analyzed by Affymetrix exon arrays and RNA-seq, yielding com- 
parable results. To determine the effect of CBX3 depletion on 
transcription, we calculated the expression of all genes in WT and 
KD HCT116 cells using the RNA-seq data set. Genes were grouped 
as those either bound or unbound by CBX3 in WT cells, and then 
expression profiles of these gene groups were compared in WT and 
KD cells. CBX3 -bound genes are expressed at much higher levels 
than unbound targets, red and blue lines versus green and black 
lines (Fig. 2A). Notably, there was a subset of genes unbound by 
CBX3 but still significantly expressed; these will be discussed later. 
Surprisingly, despite widespread CBX3 binding to actively tran- 
scribed genes, we did not observe large-scale alteration of transcript 
levels; compare red to blue lines and green to black (Fig. 2A). More 
detailed analysis revealed that a small number of genes are in fact 
differentially expressed (367 down-regulated and 340 up-regulated, 
by more than twofold) in the KD cells. Genes that were down- 
regulated upon loss of CBX3 were enriched for CBX3 binding in 
WT HCT116 cells when compared to genes up-regulated (Fig. 2B). 
The correlation between loss of CBX3 binding and a decrease in 
gene expression suggests that CBX3 can positively regulate tran- 
scription. Results were validated for several genes, including EHF 
and GPR1 1 0, by RT-qPCR and RNA-seq analysis in WT and KD cells 
(Fig. 2C; Supplemental Fig. S2B,C). Of interest, CBX3 is enriched 
across gene clusters, such as the KLK (kallikrein serine proteases) 
family on chromosome 19 whose up-regulation, like that of CBX3, 
has been implicated in carcinogenesis. Loss of CBX3 resulted in 
a coordinated down-regulation of the KLK genes (Fig. 2D). 

We analyzed the expression levels of several TNF-responsive 
genes in both WT and KD HCT116 cells by RT-qPCR and/or RNA- 
seq. As depicted in Figure 2E and Supplemental Figure S3A, mul- 
tiple TNF-responsive genes are also negatively affected by loss of 
CBX3. We observed that expression of a subset of TNF-responsive 
genes, such as NFKB1 and ZC3H12C, was not altered in KD cells. 
However, these genes possess only background levels of CBX3 oc- 
cupancy that does not increase upon gene activation (Supplemental 
Fig. S3B; data not shown). 

Loss of CBX3 caused only mild transcriptional defects for 
a subset of genes. This suggests that the primary function of CBX3 
binding to gene bodies may not be to directly regulate tran- 
scription levels per se. We therefore considered the possibility 
that CBX3 was influencing multiple facets of gene regulation. 

CBX3 preferentially associates with exons in multi-exonic genes 

Further analysis of CBX3 targets revealed that the bound genes 
were significantly longer than unbound genes (27227 bp versus 
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Figure 2. CBX3 target genes are down-regulated in KD HCT116 cells. (A) Comparison of Log 2 
(RPKM expression) profiles for genes in KD and WT cells, grouped as either bound or unbound by 
CBX3 (in \A/T cells). Bound WT red line, bound KD blue line, unbound WT green line, and unbound KD 
black line. (B) CBX3 enrichment, from WT cells, for genes up- or down-regulated in CBX3 KD HCT1 1 6 
cells. Log 2 CBX3 (IP/input), from CBX3 ChlP-seq in WT HCT116 cells, binned as in Figure IB with 
random regions as a control. (C, left) ChlP-seq UCSC Genome Browser snapshot of CBX3 enrichment 
at the EHF gene. (Middle) RT-qPCR analysis of EHF expression in WT and KD HCT1 1 6 cells. (Right) RNA- 
seq UCSC Genome Browser snapshot of EHF expression level in WT and KD cells. (D, left) CBX3 
binding. (Right) RNA-seq browser shot (as in C) for KLKgene cluster, (f, left) RT-qPCR analysis of TNF- 
responsive CBX3 target IL8 in KD and WT HCT116 cells treated ± TNF for 30 min. (Right) UCSC 
Genome Browser snapshot of RNA-seq IL8 expression. RT-qPCR expression levels normalized to 
GAPDH and are the average of three biological replicate experiments. Error bars calculated as the 
standard deviation between replicates. (***) P < 0.001 . 



when grouped by exon number. Inter- 
estingly, in contrast to all other groups, 
single-exon genes were highly expressed 
despite a lack of CBX3 enrichment (Sup- 
plemental Fig. S4A). These data suggest 
that expression of longer, multi-exon 
genes is more dependent on CBX3 bind- 
ing than that of single-exon genes. In 
support of this idea, the mean exon num- 
ber for the group of expressed unbound 
genes, mentioned in Figure 2A, is one. 
Next we created a heatmap of average 
CBX3 enrichment at the 5 ' of exons (and 
±2.5 kb upstream and downstream) 
ranked from low (black bar), to middle 
(green bar), and to high (red bar) expres- 
sion. We observed that CBX3 is enriched 
at exon boundaries (Fig. 3E). In addition, 
we plotted CBX3 enrichment (tag counts 
per kilo base) at exons versus introns for 
all genes and those with high, medium, 
or low CBX3 binding (Supplemental Fig. 
S4B). While CBX3 binds to introns, it is 
clearly enriched in exons. Increased CBX3 
enrichment is correlated with increased 
exon expression levels (Fig. 3E,F). Collec- 
tively, these data suggest that CBX3 could 
be playing a role in RNA processing. 

Recently, CBX3 has been implicated 
in the regulation of alternative splic- 
ing with CBX3 enriched at alternatively 
spliced exons of CD44 (Saint- Andre et al. 
2011). In contrast, CBX3 is evenly re- 
cruited to CD44 in HCT116 cells, and 
there was no change in splicing of this 
gene between WT and KD cells (data not 
shown). To further investigate, we calcu- 
lated CBX3 enrichment (RPKM value) for 
exons included in mature transcripts 
(spliced in) versus those not included 
(spliced out) in HCT116 cells. We ob- 
served that CBX3 enrichment is not sig- 
nificantly different for alternatively spliced 
or constitutive exons (Supplemental Fig. 
S4C). Additionally, loss of CBX3 did not 
decrease the inclusion of 219 alternatively 
spliced exons examined. Intriguingly, we 
found that loss of CBX3 results in an in- 
creased inclusion of exons usually spliced 
out (Fig. 3F). The preference for exon-rich 
genes and enrichment at exons coupled 



14161 bp; P < A.77 X 1(T 283 , Wilcoxon rank sum test) (Fig. 3A). 
Additionally, CBX3 -bound genes have a median exon number of 
10, whereas unbound genes have a median of five (P- value 1 x 
10" 300 , Wilcoxon rank sun test) (Fig. 3B). Next we calculated CBX3 
occupancy at genes grouped by exon number, from one to 21+. 
CBX3 binding clearly favors multi-exon genes over single-exon 
genes (Fig. 3C; Supplemental Fig. S4A). In addition, we calculated 
expression levels of genes grouped by exon number (Fig. 3D; Sup- 
plemental Fig. S4A). Overlay of these two graphs revealed that CBX3 
binding correlated with expression level of genes, for the most part, 



with increased exon inclusion in the ab- 
sence of CBX3 led us to consider the possibility that CBX3 could be 
regulating RNA processing generally but not alternative splicing 
specifically in our system. 

Loss of function of CBX3 causes a disruption in splicing 
of target genes 

To test whether CBX3 indeed participates in RNA processing, we 
first examined the IL8 gene. We designed primer pairs spanning 
each exon-intron junction as well as one exon-exon junction. 
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Figure 3. CBX3 binds to longer, exon-rich genes. (A) Box plot of Log 2 (gene length) of CBX3 bound or 
unbound genes in WT HCT1 1 6 cells. P = 4.77 x 1 0 283 . (B) Box plot of median exon number for CBX3 
bound and unbound genes. P=1X1 0~ 300 . (C) Box plot of CBX3 enrichment, log 2 (CBX3/input), for 
genes grouped by exon number from 1 to 21 +. (D) Box plot showing RPKM expression, in WT cells, of 
genes grouped by exon number from 1 to 21 +. (f ) Heatmap of CBX3 enrichment 2.5 kb upstream of 
and downstream from TSS distal exons in WT HCT116 cells. Signal normalized to input and targets 
clustered by exon expression from lowest to highest. (F) Average CBX3 enrichment at high (red), middle 
(mid-green), and low (black) third expressed exons. (C) Comparison of RPKM expression for alterna- 
tively spliced exons (alt out) in WT and KD cells. P-value for "alt out" exon in WT versus KD 2.771 x 
10 . For box plots, the median is represented by the center line; the edges of the box represent the 
25th and 75th percentiles; and the whiskers extend to non-outlier extreme data points. P-values 
obtained using the Wilcoxon rank sum test. 



Exon-intron spanning primers will amplify unspliced transcripts, 
whereas exon-exon spanning primers only measure spliced tran- 
scripts. This approach allowed us to compare the expression level 
of spliced and unspliced IL8 mRNAs in WT and CBX3 KD HCT1 1 6 
cells. We calculated the ratio of intron (exon-intron junction) to 
exon (exon-exon junction) expression across the IL8 gene for WT 
and KD cells (Fig. 4 A, left). In addition, the fold change of intron 



expression, or the level of unspliced 
transcript, in WT to KD cells was analyzed 
(Fig. 4A, right). We detected a significant 
increase in intron inclusion in KD HCT116 
cells at every splice junction across the 
IL8 gene (P < 0.05, t-test). In fact, loss of 
CBX3 leads to a two- to fivefold increase 
in unspliced IL8 transcripts in the KD 
HCT1 16 cells. 

To ascertain if this was a common 
effect of CBX3 loss, we analyzed intron 
inclusion in several additional TNF- 
responsive genes, including ILIA, EHD1, 
and LTE (Fig. 4B; Supplemental Fig. S5A). 
As observed for IL8, loss of CBX3 resulted 
in an increase in unspliced transcripts for 
all CBX3 target genes tested. The corre- 
lation between loss of CBX3 and increase 
in intron inclusion was also seen for the 
CBX3 -responsive gene HOXA9 in unstim- 
ulatedcells (Fig. 4C). However, no increase 
in unspliced transcripts was observed for 
genes not bound by CBX3, such as 
NFKB1 and ZC3H12C (Fig. 4D; Supple- 
mental Fig. S5B); these genes show no 
disruption of RNA processing. There- 
fore, the RNA processing defect we ob- 
serve correlates with a decrease in CBX3 
binding. 

Loss of CBX3 results in global RNA 
processing defect 

To further investigate the role of CBX3 in 
RNA processing, we performed genome- 
wide analysis of both poly-A and non- 
poly-A transcript levels from WT and KD 
HCT116 cells. Total RNA was isolated 
from WT and KD HCT116 cells and sub- 
jected to high-throughput sequencing 
analysis after depletion of ribosomal RNA. 
Exonic and intronic expression was mea- 
sured using this procedure for all UCSC 
genes with a detectable transcription 
(RPKM > 0.5, which is approximately the 
median RPKM of the samples). We quan- 
tified the fraction of intronic expression as 
intron RPKM/exon RPKM for each gene 
and found 1037 transcripts that display 
a marked increase in intronic expression in 
the KD compared with WT cells (KD/WT > 
2). This increase in intron inclusion for 
genes in KD cells could also be observed in 
the raw RNA-seq data visualized in the 
UCSC Genome Browser (Fig. 5A). Next we 
ascertained whether loss of CBX3 binding correlated with an in- 
crease in unspliced transcripts. We detected significant enrichment 
of CBX3 at genes with increased unspliced transcripts (Fig. 5B). 
Analysis of a replicate RNA-seq data set produced similar results, 
with significant overlap of genes found to have increased intronic 
inclusion (P< 1 X 10~ 16 ) and CBX3 enrichment (Supplemental Fig. 
S6). Our data demonstrate that genes bound by CBX3 in WT cells 
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recruiting splicing factors to the elon- 
gating polymerase (Shepard and Hertel 
2009; Lenasi and Barboric 2010; Licatalosi 
and Darnell 2010). Since CBX3 binds to 
RNA polymerase II, we posited that CBX3 
could also be playing a role in recruitment 
of the splicing complexes or regulators. 
Therefore, we performed ChIP to analyze 
splicing factor recruitment to a series of 
candidate genes in WT and KD cells. In- 
terestingly, we discovered that SNRNP70 
(U1-70K, 70K protein subunit of Ul) and 
SRSF1 (SF2/ASF, SR family protein) re- 
cruitment, to ILIA, is decreased upon 
CBX3 loss (Fig. 6A). This was not due 
to a decrease in the expression level of 
splicing factors in the KD cells, as the 
levels were comparable to those seen for 
WT HCT116 by Western blot (Supplemen- 
tal Fig. S7A). Notably, this phenomenon 
was limited to active CBX3-bound genes 
where we observed splicing defects (e.g., 
ILIA) and not control, non-CBX3 targets, 
such as NFKB1, that do not have splicing 
defects (Fig. 6B). Therefore, recruitment or 
stabilization of the spliceosome factors 
tested appears to be dependent upon CBX3 
localization to the gene in question. 
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Figure 4. Depletion of CBX3 causes a decrease in splicing only at CBX3 target genes. (A-D) RT-qPCR 
analysis of intron/exon expression ratios for intron-exon boundaries across candidate genes. The ratio 
was calculated by dividing intron expression, from intron-exon junction primer set, by exon expression, 
from intron spanning primer set. (A,B,D) RNA isolated from WT or KD HCT1 1 6 cells treated with TNF for 
30 min. (C) RNA isolated from untreated WT and KD cells. (A, left) Intron/exon expression ratios for all 
intron-exon boundaries across the IL8 gene. (Right) Fold change of intron/exon expression over WT for 
left panel. (B, top) Intron/exon expression ratios for TNF-responsive, CBX3 target genes EHD1 and LTB. 
(Bottom) Fold change of intron expression over WT. (C) Intron/exon expression ratios for HOXA9. (D) 
Intron/exon expression ratio for NFKB1, a TNF-responsive, non-CBX3 target gene. Expression levels of all 
samples were normalized to GAPDH and are the average of at least three biological replicate experi- 
ments. Error bars calculated as the standard deviation between replicates. (*) P < 0.05, (**) P < 0.01 . 



display an increase in splicing defects upon depletion of CBX3. 
Of the 367 genes down-regulated by the loss of CBX3, 85 also have 
an increase in unspliced transcripts. It could be hypothesized that 
the decrease in transcript level is due to degradation of unspliced 
transcripts. However, the significant number of genes with either 
splicing defects or a decrease in transcript level but not both suggests 
that separate mechanisms of action could be at play. 

CBX3 influences recruitment of splicing factors to actively 
transcribed genes 

The spliceosome is a large multi-subunit complex that is assem- 
bled in a highly ordered and cooperative manner to the RNA 
transcript. SR (serine and arginine-rich) proteins also bind to Ser2- 
phosphorylated RNA polymerase II and could play a role in 



Heterochromatin protein 1 has been 
implicated in a diverse array of cellular 
processes, including heterochromatin 
formation, gene silencing, DNA damage 
repair, telomere stability, and RNAi- 
mediated silencing (Hediger and Gasser 
2006; Kwon and Workman 2008; Dinant 
and Luijsterburg 2009; Zeng et al. 2010). 
Here, we provide evidence that in human 
cells CBX3 supports transcription and 
efficient splicing. We demonstrate that 
CBX3 is localized to the gene body of the 
most actively transcribed genes. We ob- 
serve that CBX3 preferentially associates 
with exons in multiple-exon genes. 
Knockdown of CBX3 expression results 
in down-regulation of a subset of target genes and significant ac- 
cumulation of unspliced transcripts from greater than 1000 tran- 
scripts. Depletion of CBX3 also causes a defect in recruitment of 
splicing factors SNRNP70 and SRSF1 to the CBX3 target gene ILIA. 
This is suggestive of a role for CBX3 in RNA processing through 
recruitment or stabilization of spliceosome components to at least 
a subset of target genes. 

Heterochromatin protein 1 family proteins are conserved 
from yeast to humans and are widely accepted as critical compo- 
nents of silenced/repressed chromatin (Singh and Georgatos 2002; 
Dillon 2004; Maison and Almouzni 2004; Huisinga et al. 2006; 
Lomberket al. 2006; Motamedi et al. 2008). However, in Drosophila 
both SU(VAR)205 and HPlc are also found at active loci (Fanti et al. 
2003; Greil et al. 2003; de Wit et al. 2007). In mammalian cells, 
CBX5 and -1 are predominantly heterochromatic but are known to 
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Figure 5. Loss of CBX3 causes RNA processing defects genome-wide. (A) UCSC Genome Browser 
snapshots showing representative RNA-seq data of CBX3 target genes BIRC5, HSPBl, and RPL18. Ex- 
pression tag counts in ln(x + 1) scale. Gene schematic showing exons (filled black box) and introns 
(hashed lines) below tracks. (B) ChlP-seq CBX3 enrichment (IP/input RPKM) at 5'end, coding region, 
and 3' end of genes having increased intron inclusion (unspliced) in KD versus WT HCT1 16 cells as 
compared to average CBX3 enrichment for all genes. Expression of introns and exons analyzed by RNA- 
seq in WT and KD cells. For box plots, the median is represented by the red line, the edges of the box 
represent the 25th and 75th percentiles, and the whiskers extend to non-outlier extreme data points. 



be recruited to specific promoters to silence transcription (Feldman 
et al. 2006; Yahi et al. 2008). In contrast, CBX3 was found to be 
more widely associated with euchromatin and is recruited to si- 
lence promoters by a variety of factors (Ogawa et al. 2002; Vicent 
et al. 2006; du Chene et al. 2007; Liu et al. 
2010). More recently CBX3 was also A 
found localized to active genes in a num- 
ber studies (Vakoc et al. 2005; Smallwood 
et al. 2007; Mateescu et al. 2008).This 
discovery sparked much debate and spec- 
ulation about the role of a heterochroma- 
tin factor in this context. Most recently 
a role for CBX3 in alternative splicing, 
at a select group of genes, has been sug- 
gested (Saint- Andre et al. 201 1). However, 
to date, no examination of CBX3 locali- 
zation or function genome-wide has been 
performed. Our study represents the first 
high-resolution genome-wide analysis of 
a mammalian heterochromatin family 
member. We found that CBX3 is pre- 
dominantly localized to the gene body of 
active genes in all the cell types exam- 
ined. While CBX3 binding is usually 
limited to between the transcription start 
and termination sites, there are examples 
of large domains of CBX3 that encompass 
multiple genes including the HOXA and 
KLK loci. CBX3 binding is strongly cor- 
related with the level of gene expression. 



Furthermore, we observe that CBX3 re- 
cruitment is inducible by extra-cellular 
signaling. 

It has been posited that CBX3 may 
act to attenuate the transcription out- 
put of its target genes (Hiragami and 
Festenstein 2005; Vakoc et al. 2005). Sup- 
porting this hypothesis, it was demon- 
strated in Drosophila that genes bound by 
SU(VAR)205 are expressed at lower levels 
(de Wit et al. 2005). We utilized a HCT116 
CBX3 knockdown cell line to probe the 
effect of CBX3 on gene expression in 
human cells. Knockdown of CBX3 only 
modestly affected global transcript level, 
with 367 genes being down-regulated and 
340 up-regulated (by more than twofold). 
Interestingly, only down-regulated genes 
were bound by CBX3, suggesting a posi- 
tive role for CBX3 regulating gene ex- 
pression. It should be noted, however, 
that the number of target genes affected 
by CBX3 depletion is small in compari- 
son to the total number of CBX3 target 
genes. While we cannot completely ex- 
clude the role of residual CBX3 in the KD 
HCT116 cells, this result led us to hy- 
pothesize that CBX3 could be playing 
another role aside from transcriptional 
regulation. Further analysis of targets 
revealed that CBX3 preferentially binds 
to longer, exon-rich genes. Interestingly, 
binding of CBX3 correlates with gene expression for all but single- 
exon genes. The lack of requirement for CBX3 at single-exon genes 
strongly suggested a role for CBX3 in splicing. In support of this 
hypothesis, the loss of CBX3 led to an increase in intron retention 
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Figure 6. CBX3 binds preferentially to exons and influences recruitment of splicing factors. (A,B) 
ChlP-qPCR for SNRNP70 and SRSF1 splicing factor binding to ILIA and NFKB genes in WT and KD 
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of at least three biological replicate experiments. Error bars calculated as the standard deviation between 
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in CBX3 target transcripts. Importantly, this is a genome- wide 
phenomenon, and genes that exhibit splicing defects in the 
absence of CBX3 are bound by CBX3 in WT cells. Therefore, 
decreased transcript levels of CBX3 target genes observed upon 
CBX3 depletion in HCT116 cells may result from their increased 
degradation due to improper splicing or from a direct effect of 
CBX3 on transcriptional output. 

Saint- Andre and colleagues (Saint-Andre et al. 2011) suggest 
a role for CBX3 in alternative splicing of CD44 in HeLa cells, 
whereby depletion of CBX3 led to a decreased inclusion of variant 
exons. Our study takes a wider unbiased approach to determine the 
effect of CBX3 depletion on transcriptional and splicing processes 
genome- wide. Therefore, in contrast to the previous study, we 
observed that loss of CBX3 caused a general increase in intron 
inclusion and loss of splicing and therefore resulted in accumula- 
tion of unspliced RNAs. However, GO analysis revealed that many 
of the unspliced transcripts that accumulate in KD cells are known 
to be alternatively spliced. It is therefore possible that CBX3 is 
playing multiple overlapping roles in splicing regulation. 

It has been previously suggested in Drosophila that SU(VAR)205 
could function to stabilize RNA transcripts (Piacentini et al. 2009). 
We performed RNA stability assays to determine if CBX3 affects 
splicing via RNA stabilization. HCT116 cells were treated with 
a-amanitin for 0-120 min, and the RNA was isolated at various 
time points. Levels of HOXA9, EZR, and EHF spliced and unspliced 
transcripts were determined by RT-qPCR using multiple indepen- 
dent primer sets (Supplemental Fig. S7B-D). A decrease in splicing of 
these transcripts was observed in response to CBX3 knockdown in 
HCT116 cells. We could detect no difference in RNA stability be- 
tween WT and KD cells; therefore, CBX3 must be functioning via a 
different mechanism in human cells. 

Most notably, we demonstrated that in human cells depleted 
of CBX3, recruitment of the RNA splicing factors SNRNP70 and 
SRSF1 was decreased significantly to ILIA. This loss of splicing 
factor recruitment was specific to CBX3 target genes. Levels of 
SNRNP70 and SRSF1 were unaffected at genes not bound by CBX3, 
such as NFKB1. While these data do not provide a direct role for 
CBX3 in splicing, the correlation between CBX3 occupancy and 
splicing defects does implicate CBX3 in the process. RNA poly- 
merase II plays a central role in the coupling of transcription, and 
splicing Ser-2 of the RNA polymerase II C-terminal domain (CTD) 
is phosphorylated during elongation, creating a binding platform 
for recruitment of splicing factors to the RNA template (Allemand 
et al. 2008; Tilgner and Guigo 2010). CBX3 also interacts with 
phosphorylated RNA polymerase II, raising the possibility that the 
CTD of polymerase could act as a platform to coordinate the action 
of CBX3 and components of the splicing machinery. The splicing 
factor SRSF1 binds to Ser-2 phosphorylated RNAP II, and this in- 
teraction aids recruitment of Ul and U2 snRNPs (Allemand et al. 
2008; Moore and Proudfoot 2009; Perales and Bentley 2009). We, 
and others, observe that CBX3 also associates with the elongating 
form of RNA polymerase II, thus providing a mechanism by which 
CBX3 may influence the recruitment and/or activity of splicing 
factors. 

Recently, Black and colleagues (Black et al. 2010) demon- 
strated that CBX3, along with a histone demethylase, plays a role 
in cell cycle regulation in Caenorhabditis elegans and human cells. 
Additionally, CBX3 has been implicated in controlling levels of 
primordial germ cells (PGCs) in mouse embryos via cell cycle reg- 
ulation and apoptosis (Abe et al. 2011). Overexpression of CBX3 
has been detected in multiple cancer cell lines and is thought to 
play a role in tumorigenesis (Takanashi et al. 2009). We performed 



GO term analysis of genes either down-regulated or with decreased 
RNA processing upon depletion of CBX3. We determined that 
CBX3 is responsible for regulating many genes involved in cell 
cycle progression, including CDK1 1A, CDC25C, DDX11,ATM, and 
CCNE1. In addition, loss of CBX3 altered expression of PIM1, 
CAM 2, and CCNE1, all of which are thought to be involved in 
cancer progression. 

Collectively, this study extends the previous observation that 
CBX3 binds to gene bodies by demonstrating that this is a genome- 
wide phenomenon. In addition, binding correlates with gene ac- 
tivity, in multiple cell types, and depletion of CBX3 leads to a de- 
crease in transcription for a subset of target genes. Importantly, this 
study begins to shed light on another role of CBX3 in this context: 
aiding in efficient processing potentially via splicing factor re- 
cruitment. Our data provide evidence that CBX3 may play a role 
in a variety of cellular processes through regulation of gene 
expression and splicing. 

Methods 

Generation of stable CBX3 knockdown HCT116 cells 

The CBX3 shRNA used in this study was designed using the 
pSicoOligomaker 1.5 algorithm (Jacks Laboratory, MIT). Oligonu- 
cleotides encoding the shRNA were annealed and cloned into the 
pLL3.7 lentiviral expression vector according to the Jacks Labo- 
ratory's protocol. Oligonucleotide sequences are as follows: for- 
ward oligo, 5 ' -TGAGGCAGAGCCTGAAGAATTTCAAGAGAATTC 
TTCAGGCTCTGCCTCTTTTTTC-3 '; reverse oligo, 5'-TCGAGA 
AAAAAGAGGCAGAGCCTGAAGAATTCTCTTGAAATTCTTCAGG 
CTCTGCCTCA-3'. Lentivirus production and transduction of 
HCT116 cells were carried out using protocols established in the 
Jacks Laboratory. Individual HCT116 cell clones expressing the 
CBX3 shRNA were isolated by limiting dilution, and the degree of 
CBX3 depletion in individual clones was ascertained by Western 
blot analysis. Clones 8 and 10 were selected for further study. 

Cell culture 

K562, MDA-MB-231, and HCT116 cells were grown in Iscove's 
Modified Dulbecco's, Leibowitz, and McCoys media, respectively, 
all supplemented with 10% FBS and 1% pen-strep. Cells were 
treated with either 10 ng/mL TNF for 30-60 min or IFNG for 30 
min. For ChIP, cells were crosslinked with 1% formaldehyde for 15 
min at 37°C and then harvested as previously described (Li et al. 
2003; Kim et al. 2005b). For expression analysis, RNA was isolated 
with TRIzol as per manufacturer's instructions. 

Chromatin immunoprecipitation 

ChIP was performed as previously described (Li et al. 2003; Kim 
et al. 2005a). Briefly, 2 |xg of specific antibodies was bound to Dynal 
magnetic secondary beads for 6 h. After washing, 200 |xg of soni- 
cated chromatin was incubated with the bead-bound antibody 
overnight. Beads were washed, and enriched DNA was isolated. 
ChIP DNA was then analyzed by hybridization either to custom- 
made DNA microarrays covering the ENCODE regions (Li et al. 
2003; Kim et al. 2005a) or to oligonucleotide microarray arrays 
covering the entire human genome (Roche NimbleGen) (Kim et al. 
2005b). The ChIP DNA was also sequenced after adapter ligation 
and amplification steps (Johnson e t al. 2007), as previously de- 
tailed. For qPCR analysis, samples and inputs were diluted eight- 
fold, and then enrichment of DNA was analyzed with primers 
specific to candidate genes (see Supplemental Materials). 
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ChlP-chip analysis 

Enriched DNA from ChIP experiments were prepped for hybrid- 
ization to Roche NimbleGen microarrays as previously described 
(Kim et al. 2005b). Briefly, 1 |xg each of immunoprecipitated or 
input genomic LM-PCR DNA was annealed with Cy5 and Cy3 end- 
labeled random prime nonamer oligonucleotides. Annealed DNA 
was then incubated with DNA polymerase Klenow fragment and 
dNTPs for 2 h at 3 7°C. The labeled samples were isolated by ethanol 
precipitation. Equal amounts of Cy5 -labeled immunoprecipitated 
DNA and Cy3 -labeled input DNA were mixed and hybridized to 
NimbleGen HD2 economy tiling arrays (hgl8 NimbleGen- Roche) 
using the MAUI Hybridization Station (BioMicro Systems) for 16- 
20 h at 42°C. The hybridized slides were washed twice in Wash 
buffer 1 (0.2X SSC, 0.2% SDS, and 0.1 mM DTT) and once each in 
buffer 2 (0.2 X SSC and 0.1 mM DTT) and buffer 3 (0.05% SSC and 
0.1 mM DTT) and then dried. The arrays were scanned on an Axon 
GenePix 4000B scanner (Axon Instruments) at 532 nm for Cy3 and 
635 nm for Cy5. We used MA2C package (Song et al. 2007) to do 
data normalization with default parameters (Robust C=2) and peak 
calling. 

ChlP-seq analysis 

ChlP-seq reads were mapped by the Eland to hgl8, keeping only 
uniquely mapping genes and discarding all but one of clonal reads 
(the number of monoclonal reads for each sample is included in 
Supplemental Table S4). Blocks of CBX3 enrichment over input 
were identified as previously described (Hawkins et al. 2010). To 
compute ChIP enrichment relative to input, we computed the 
RPKM of ChIP and input for each of the regions and divided ChIP 
RPKM by input RPKM. To avoid division by zero, a pseudo count of 
1 was added to each region before division. CBX3 -bound genes 
were defined as those having log 2 (CBX3 RPKM/ input RPKM) > 1 
and unbound genes log 2 (CBX3 RPKM/ input RPKM) < 0. Up/down- 
regulated genes were defined as those having median level of ex- 
pression in either WT or KD cells and that exhibit at least a twofold 
difference in expression. For exon enrichment, the RPKM of ChIP 
and input for 100-bp bins around exon 5' ends was calculated then 
and divided ChIP RPKM by input RPKM. Cassette exons were de- 
fined by the UCSC "knownalt" table from the alt events track. 

CBX3 coimmunoprecipitation 

Four micrograms of CBX3 antibody (Millipore 05-690) or R-IgG 
was incubated with 20 |xL of Protein G beads in lx PBS/5 mg/mL 
BSA for 2 h at 4°C. Unbound antibody was removed by washing 
with 1 X PBS/5 mg/mL BSA. Antibody-bound beads were incubated 
with 6 mg of nuclear or nuclear pellet extract for 4 h at 4°C. Un- 
bound factors were removed by washing three times with 300 mM 
Buffer D (300 mM KCL, 20 mM HEPES at pH 8.0, 0.1 mM EDTA, 
20% glycerol) and once with 100 mM Buffer D. Beads were boiled 
in 50 (jlL of SDS loading dye for 10 min prior to SDS-page gel 
electrophoresis. Factors bound to CBX3 were analyzed by Western 
blot. 



GST-CBX3 purification 

CBX3 protein and GST-CBX3 beads were purified as previously 
detailed (Vassallo and Tanese 2002; Smallwood et al. 2007). GST- 
CBX3 proteins were purified by expression in bacteria and en- 
richment on glutathione columns. Proteins were released from the 
GST tag and beads by thrombin cleavage. The concentration of 
CBX3 or GST-CBX3 was analyzed by SDS-PAGE and Coomassie 
staining. 



Peptide pulldowns 

Interaction of soluble nuclear factors with immobilized peptides 
was performed as previously described (Wysocka et al. 2005; Sims 
et al. 2006). In short, 5 |xg of biotin tagged histone tail peptide was 
bound to 25 |xL of streptavidin coated beads for 15 min at room 
temperature in 5 mM Tris (pH 8.0), 0.5 mM EDTA, and 1 M NaCl. 
Unbound peptides were washed away by washing three times with 
100 mM Buffer D. Beads were incubated with 300 ng-1 |xg of pu- 
rified CBX3, in 300 |xL of 100 mM Buffer D, for 2 h at 4°C. Un- 
bound factors were removed by three washes with 300 mM Buffer 
D. CBX3 associated with the peptides was eluted by boiling in 2% 
SDS and analyzed by SDS-page electrophoresis and Western blotting. 

RT-qPCR 

Total RNA was isolated by TRIzol extraction, DNase 1 treated, and 
then subjected to cDNA synthesis using Invitrogen Super script kit 
as per the manufacturer's instructions. qPCR analysis was per- 
formed using 0.5 (jlL of cDNA as template in 20 \xL reaction vol- 
umes using Sybr green master mix from Roche. All results were 
normalized to GAPDH expression. A no-RT control was included in 
each experiment to ensure results were not from contaminating 
DNA. Analysis shown is from at least three biological replicates for 
each sample run in triplicate on each qPCR plate; error bars cor- 
respond to the standard deviation of the biological replicates. 
P- values were calculated, by t-test, from the biological replicates. 
Specific primers used are listed in the Supplemental Material. 

RNA-seq 

Ten micrograms of total RNA was depleted of ribosomal RNA using 
the Invitrogen Ribominus kit. The resulting RNA fraction was 
subjected to first-strand cDNA synthesis using random primers in 
the presence of actinomycin D. Samples were cleaned up by ap- 
plying reaction mix to mini G50 columns. The eluted sample was 
then subjected to second-strand synthesis with DNA polymerase I, 
DNA ligase, and dUNTPs in the presence of RNase H. This ensures 
that the cDNA strand complementary to the mRNA has dNTPs and 
that the other strand is specifically labeled with dUNTPs. The 
cDNA was purified by QIA-quick-QG columns and then sonicated 
using a Bioruptor and fragment size assessed by mini-gel electro- 
phoresis. The fragmented cDNA was subsequently end repaired, 
A-tailed, and then ligated to adapters. After Qiagen column purifi- 
cation, the samples were size-selected on a 2% high-resolution 
agarose gel. Fragments corresponding to 175-350 bp were excised 
from the gel and collected by Qiagen column purification. The 
fragments were then treated with UDGase to remove the labeled 
strand, and then the mRNA complementary cDNA strand was 
amplified by PCR and subjected to sequencing analysis using the 
Illumina GAIL Reads were mapped by bowtie to hgl8, keeping 
only uniquely mapping genes and discarding all but one of clonal 
reads. RPKM was calculated as the number of reads per kilo base of 
exon model per million reads mapped. 

Data access 

Sequencing data have been submitted to the NCBI Gene Expres- 
sion Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo/) under 
accession number GSE28115. 
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